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UPA, A UNIVERSAL PROTEIN ARRAY SYSTEM 

FIELD 

The present invention relates to detection of interactions between polypeptide and protein, 
DNA, RNA and/or iigand molecules. 

BACKGROUND 

Gene expression in eukaryotic cells is controlled by numerous ftindamental and selective 
protein-protein. protein-DNA, protein-RNA and protein-ligand interactions. Cancer, as well as other 
genetic diseases, results from abnormal gene expression. Interactions of proteins with proteins and 
other biomolecules play a pivotal role in almost every aspect of gene expression. Therefore, factors 
involved in these interactions, including transcription factors, signal transduction factors, growth 
factors and the products of other oncogenes, nimor suppressor genes, viral genes and many cellular 
genes, have been implicated as potential targets for new drugs (Hurst, Eur. J. Cancer, 32A, 1 857- 
1863. 1996; Bustin and McKay, Br. J. Biomed. &/., 51, 147-157. 1994; Powis. Pharmac. Ther, 62. 
57-95, 1994; Krantz, Nature BioiechnoL, 16, 1294, 1998). 

Use of transcription factors has proved to be a successful means to identify new drug targets 
in cancer and other human disease. The basal transcription machinery of class 11 genes consists of at 
least six general transcription fectors, including TFIIB, TFIID, TFIIE. TFIIF, TFIIH and RNA 
polymerase II. However, an additional activator(s) and coactivator(s) are required for regulated 
(activated) transcription (Orphanides ei al.. Genes Dev., 10, 2657-2683, 1996; Ptashne and Gann. 
Nature, 386. 569-577, 1997). Both basal and activated transcriptions are controlled largely through 
protein-protein interactions between transcription factors and through prolein-DNA interactions. 
Thus, insight into factor communication holds not only the key to understanding mechanisms of gene 
regulation, but also provides a means of understanding mechanisms of pathogenesis and of 
identifying anticancer drugs. 

At present, in addition to the two-hybrid system and co-immunoprecipitation assays usually 
used to detect protein-protein interactions in vivo, the glutathione 5-transferase (GST) pull-down 
assay is one of the more common methods to detemine specific protein-protein interactions in vitro. 
Cross-linking, gel mobility shift, footprinting and others have been often used to study protein-DNA 
and protein-RNA interactions (Fields and Stemglanz, Trends Genet., 10. 286-292, 1994; Harris, 
Methods Mol. Biol., 88, 87-99, 1998). Recently, several methods, including serial analysis of gene 
expression (SAGE) (Velculescu et al.. Science, 270, 484-487. 1995). cDNA microarrays (Schena et 
al.. Science, 270. 467-470. 1995) and oligonucleotide-based DNA chips (Chee e, al.. Science, 274, 
610-614. 1996), have been employed to study the relationship between gene expression and cancer 
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SUMMARY 

i.cow,,. 1 . ^"s^ce biomo ecules. UPA can hp 

------ — ,..tr™~:r,- 

protein content of the preparation. 

embodiments, the support is a olasssliHp ru;. im H«aicuiar 
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from the following group of polypeptides: TFIIA, TFIIB, TBP, fTFIID. TFIIE, TFIIF, fTFIIH Pol 
H, RXR. TR. Octl. Spl. G4-94. G4-147. G4-AH, G4-VPI6. G4-CTF, G4-Spl. G4-EI A G4-IE 04- 
Tat, PC4-P. PC4-N. PC4-C. PC4-AS. PC4-n,l. PC4.m2. PC4-m3. PC4.m4. PC4.m5. PC4-m6. PC4- 
m7. PC4-wt. p52. p75. p75.C. p300-C. PCAF. PCAF-C. TAF250. Topo I (wt), Topo I (mt). Topo 1 
(wt)«. Topo 1 (nati). ASF. SR. GST-Nu. and GST-K. This represents specific but non-limiting 
examples of certain proteins that may be presented as targets on an airay of this invention. 
The present invention also provides assays employing these arrays. 
Cenain embodiments of the invention are array-based protein interaction assays, wherein an 
array (either a macro- or microarray) of target polypeptide molecules is conucted with a detectable 
probe molecule under conditions sufficiem to produce binding (eg., a binding pattern). Binding can 
then be detected. In certain embodiments, the polypeptides of the array are substamially pure 
preparations of polypeptide. Polypeptides may for example be stably associated with the surface of 
the array, which may be a solid support. Examples of such assays include a funher step of removing 
unbound probe molecule(s) prior to detecting the binding panem of the probe. 

Probes for use with assays of this invention can be any molecules that might bind to a 
polypeptide. Examples of probes include single-stranded nucleic acids (DNA or RNA). double- 
stranded nucleic acids (DNA or RNA). proteins, and ligands (e.g.. drugs, toxins, venoms, honnones. 
co-factors, substrates or reaction products of enzymatic reactions or analogs thereof, transition state 
analogs, minerals, and so forth). Such probes are detectable, either due to inherent features of the 
probe (such as immunogenicity. which can be detected through interaction with an antibody) or 
through the attachment or association of a label or tag molecule. Examples of tags include 
fluorescent tags, luminescem tags, and immunogenic tags. 

Other assays provided by the invention can be used to determine one or more polypeptide- 
binding characteristics of a probe molecule. Such assays may include preparing a labeled sample of 
the probe molecule (for instance, a nucleic acid molecule, polypeptide or ligand). The probe is 
contacted to an array of target polypeptides to produce a binding panerr,. which can then be detected. 
In certain embodiments, unbound probe is washed or otherwise removed from the anay. for instance 
prior to detecting the binding pattem, to reduce or remove background signals. Target polypeptides 
of the arrays used in these assays are stably associated with a solid support. 

Examples of labels for use with any of the assays of the invention include all labels that can 
be attached to a probe molecule to facilitate detection of the molecule. Such labels include tags that 
can be directly detected (e.g., radioisotopes, fluorcscem or luminescem tags) as well as labels that 
require secondary detection (eg., immunogenic or epitope tags, members of the strept/avidinrbiotin 
system). Probes can also be detectable in the sense that they can be detected based on a characteristic 
inherent in the probe itself (eg., immunogenicity, inherent fluorescence, etc.). 

This invention also provides kits for labeling probe molecules to be used with array-based 
protein interaction assays (e.g., universal protein array based assays). Such kits include at least a tag 
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subsoi,iiallyp„„p,|yptp,ij,„jl^l^ ' 
»««,,„..,.^^,,_^^ 

of » JTT* ' 
^-pp...f^^tap^pH„„^,,^^^,.^^^ 

P.to.p..d= ».pte (Which he »h»„»^|,p^, i^^,„^ » . »,id »pp„, |„ „ 

rrir "•""'^*'«"--"-<-»->--«.^-», 

The fo„g.i„8 »h„ f».„, ,^ „f ^ ^^.^ 1^^^^ 

apparent from the following detailed de«;rrmtmn «r,^ i 

, ^ ^ 'lescnption of several embodiments, which proceeds with 

reference to the accompanying figures. 



wo 00/54046 



PCTAJSOO/06244 



5- 



BRIEF DESCRIPTION OF THE FIGURES 
Figure 1: Protein-Protein Interactions 

The universal protein amy (UPA) provides quantitative detection of specific protein-protein 
interactions at different salt wash stringencies. Fig. 1 A shows the autoradiographic signals detected 
from the herein-described UPA that was incubated with »P-labeled GST-K-p52. then washed with 
low salt buffer AlOO (100 mM KCl) to remove unbound probe, as described in Example 3. Fig. IB 
shows the autoradiographic signals detected from the same UPA after it was washed in high salt 
AlOOO buffer (100 mM KCl). 

Table 1 is the polypeptide target arrangement key for the array shown in Fig. 1 A and IB. 

Fig. IC is a pictorial representation of the relative affinities of the 48 airayed proteins for 
the transcriptional cofactor p52 after the amy was washed with buffer AlOOO (100 mM KCl). The 
units are reading units from a densitometer. 

Figure 2: Protein-DNA. Protein-RNA, and Proteih-Ligand Interactions 

The universal protein array also pennits autoradiographic detection of protein-dsDNA (Fig. 
2A). protein-ssDNA (Fig. 2B). protein-RNA (Fig. 2C), and protein-ligand (Fig. 2D) interactions. 
The same UPA was probed with «P-labeled nucleic acids (Examples 4 and 5) or with '"I-labeled 
ligand (Example 6) as described in the text. Between each application of probe, the UPA was 
stripped and equilibrated in buffer A 1 00, as described in Example 2. 

As for Fig. 1, Table 1 contains the polypeptide target arrangement key for the array shown 

in Figure 2. 

Figure 3: Detection of ASF/SF2-Interacting Proteins Using a UPA 
Sixteen selected proteins/protein fractions were analyzed for interaction with 32P-labeled 
6H(K)ASF/SF2. Fig. 3 A shows the key grid of 1 6 proteins that were arrayed (in a 4 by 4 grid 
fomat). Fig. 3B and Fig. 3C are autoradiographs of the binding patterns on the UPA after washing 
with 1 00 mM KCl or 500 ml KCl, respectively. 

Key to the abbreviations in Fig. 3A: CTD. the C-terminal domain of RNA polymerase 11 
fused to GST; RPB5. RPB6, RPB8. RPBlOa and RPBIOP correspond to individual subunits of RNA 
polymerase II fiised to GST; TBP. TATA-binding protein; f:TFIID. affinity-purified flag-tagged 
TBP-containing TFIID complex from HeLa cells; RXR, retinoid-X receptor; TR, thyroid hormone 
receptor. His-Hl, histone HI; CcHis, co-histones; HMGI. high mobility group protein 1; ASF. 
alternative splicing fector. GST-Nu. GST-nucleolin firsion; GST-K. GST fiised with a synthetic heart 
muscle kinase site. 



wo 00/54046 



PCT/USOO/06244 



DETAILED DESCRIPTION 
Abbreviations and Definitions 



A. 



Abbreviations 



20 



5 ASF: alternative splicing factor 
Co-His: co-histones 

CTD: the C-tenninal domain of RNA polymerase II fused to GST 
^TFIID: flag-tagged TBP-containing TFI.D complex from HeLa cells 

G4.94,G4-147,G4-AH,G4.VP16,G4-CTP r^-c , r^. ^ 

'0 differem,n.„,. / • '''''^'"'^^^''^♦C^'E.C4.Tat: GaUfusedto 

different transcription activation domains (see Table 2) 

GST: Glutathiones-transferase 
GST-K: GST fused with a synthetic heart muscle kinase site 
GST-Nu: GST-nucleolin fusion 
His-Hl: histoneHl 
'5 HMG 1 : high mobility group protein 1 
Oct l: B-cell specific activator 
p52: novel transcription factor p52 
p75: novel transcription factor p75 
P75-C: C-tem,inal region of novel transcription factorp75 
p300-C: transcriptional activator 
PC4: positive cofactor 4 

m7. PC4-wt: various PC4 polypeptides (see Table 2) 
PCAF: a p300/CBP-associated factor that functions as a histone 
PCAF-C: C-tenninal region Of PCAF 
Pol II: polymerase II 

Spl: class II gene activator 

WtR: retinoid-X receptor 
TAF250: transcriptional coactivator 
TBP: TATA-binding protein 

TFIIA, TFIIB, TBP, frTFIID, TFIIE, TFIIF fTFIlH r„„ ,u - 

" HE. IIF. and IIH tnmscnption factors IIA, IIB. IID. 

Topo I (Wt), Topo I (mt), Topo I fwtt* Tnn„ I / . . 

Table2) . ' 'opoisomerase 1 polypeptides (see 



25 



wo 00/54046 



PCT/USOO/06244 



-7- 



10 



TR: thyroid hormone receptor 
UPA: universal protein array 

B; Definitions 

Unless otherwise noted, technical terms are used according to conventional usage 
Definitions of common terms in molecular biology may be found in Benjamin Lewin, Gene, VII, 
publ.shedbyOxfordUniversityPress.2000(lSBN0-19-899276-X);Kendrewer./ (eds) The ' 
Encyclopedia of Molecular Biology, published by Blackwell Science Ltd., .994 (ISBN 0^32-02182- 
9); and Robert A. Meyer, (ed.). Molecular Biology and Biotechnology: a Comprehensive Desk 
Reference, published by VCH Publishers. Inc., 1995 (ISBN 1-56081-569-8). 

in order to facilitate review of the various embodiments of the invention, the following 
definition of terms is provided: 
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Array: An arrangement of molecules, particularly biological macromolecules (such as 
polypeptides or nucleic acids) in addressable locations on a substrate. A "microarray" is an airay that 
.s miniaturized so as to require microscopic examination for evaluation. 

Within an array, each arrayed molecule is addressable, in that its location can be reliably 
and consistently determined within the at least two dimensions of the array surface. TT,us, in ordered 
arrays the location of each molecule sample is assigned to the sample a. the time when it is spotted 
onto the array surface and usually a key is provided in order to correlate each location with the 
appropriate target. Often, ordered arrays are arranged in a symmetrical grid patten,, bu, samples 
could be arranged in other patterns (e.g., in radially distributed lines or ordered clusters). ■ 

The shape of the sample application "spot" is immaterial to the invemion. Thus, thouah the 
.erm "spot" is used throughout this specification, it refers generally to a localized deposit of sariiple 
target polypeptide, and is not limited to a round or substantially round region. For instance 
essentially square regions of polypeptide application can be used with anays of this invention, as can 
be regions tha, are essentially rectangular (such as slot blot application), or triangular, oval or 
■rregular. TTre shape of the array itself is also immaterial to the invention, though it is usually 
substantially flat and may be rectangular or square in general shape. 

A key to one example amiy is shown in Table I . Construction of this array is described in 
Example 1. THis ar«y has 48 addresses (individual spou on the array), which are arranged in an 8 
by 12 grid. With eight columns labeled "a" through "h" and twelve rows labeled "1 " through "P " 
Each address position can be referred to by a row and column label (.,., address "la" in the up'per 
left comer of the array contains transcription fector IIA, abbreviated "TFIIA") 

In this particular example array, as described below in Example 1, each target polypeptide 
has been sponed onto the array twice to provide internal controls. The duplicate samples are found 
.n a pair of horizontally adjacent addresses of the array: for instance, transcription factor IIA (TFIIA) 
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is found at both address la and addres<i i h .1 

. . This pair of addresses (^^^^^^ 

d«UB«l ,„ „m„f «, E«n,pte „ may 1^ 

K „ .^e, i„ . ^ ^ ,^ ^ 500 0, ,000 MM KC) 

P»^»«»^^«,.,„,M,^,^.,^^^,,^^^^^^ ^^^^^^^^^ 

»«».y of*. b»d», 6^ . ^^.^ 

*. f»r b,„s ,.d=.„e, ^ d™,„i.<« . 
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in a polypeptide, or for a stop signal. The term "codon" is also used for the corresponding (and 
complementary) sequences of three nucleotides in the mRNA into which the DNA sequence is 
transcribed. 

High throughput genomics: Application of genomic or genetic data or analysis techniques 
that use microarrays or other genomic technologies to rapidly identify large numbers of genes or 
proteins, or distinguish their structure, expression or function fi-om normal or abnormal cells or 
tissues. 

Isolated: An "isolated" biological component (such as a nucleic acid molecule, protein or 
organelle) has been substantially separated or purified away from other biological components in the 
cell of the organism in which the component naturally occurs, i.e., other chromosomal and extra- 
chromosomal DNA and RNA. proteins and organelles. Nucleic acids and proteins that have been 
"isolated" include nucleic acids and proteins purified by standard purification methods. The tenn 
also embraces nucleic acids and proteins prepared by recombinant expression in a host cell as well as 
chemically synthesized nucleic acids. 

Nucleic acid: A deoxyribonucleotide or ribonucleotide polymer in either single or double 
stranded form, and unless otherwise limited, encompasses known analogues of natural nucleotides 
that hybridize to nucleic acids in a manner similar to naturally occurring nucleotides. 

Oligonucleotide: A linear polynucleotide sequence of up to about 300 nucleotide bases in 
length, for example a polynucleotide (such as DNA or RNA) which is at least 6 nucleotides, for 
example at least 15. 50, 100 or even 200 nucleotides long. 

An oligonucleotide analog refers to moieties that function similarly to oligonucleotides but 
have non-naturally occurring portions. For example, oligonucleotide analogs can contain non- 
naturally occurring portions, such as altered sugar moieties or inter-sugar linkages, such as a 
phosphorothioate oligodeoxynudeotide. Functional analogs of nanitally occurring polynucleotides 
can bind to RNA or DNA. and include peptide nucleic acid (PNA) molecules. Such analog 
molecules may also bind to or interact with polypeptides or pn)teins. - 

Oligopeptide: An oligopeptide is defined as a linear molecule of about 50 or fewer amino 
acid residues. 

Peptide Nucleic Acid (PNA): An oligonucleotide analog with a backbone comprised of 
monomers coupled by amide (peptide) bonds, such as amino acid monomers joined by peptide 
bonds. 

Probe: A molecule that may bind to or interact with one or more polypeptides. A probe, as 
the term is used herein, can be any molecule that is used to challenge ("probe," "assay," "inteiTOgate" 
or "screen") a polypeptide array in order to determine the binding or interaction characteristics of the 
arrayed polypeptides with that probe molecule. In specific embodiments of the current invention, 
probes may be from different and varied molecular classes. Such classes are, for instance, nucleic 
acids (such as single or double stranded DNA or RNA). oligo- or polypeptides (such as proteins. 
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prote.f«g.e„. including domains or s.b-domains. and «^ 

protemsXorvarioustypesofotherpotentialpolype^^^^^^^^^ Such other molecules 

ar. «fened to hen.in generally as ligands (such as drugs, toxins, venoms, hormones, co-factors 

substratesorreactionp«,ductsofenzyma,icreactionsoranalogsthereof,tr.^ 
minerals, and SO forth). 

^"-"y.'-P^bemoleculeisdetectableforuseinprobinganarrayofthein P^bes 
can bedetectableb^ontheir inherent characteristics(e.,.immunogenici.y)orc. 
^'-•^'''^''ybeinglabeledwithanindependentlyde.eaable^^^ 

feature that .s, for example, microscopically distinguishable in shape, size, color.opt^ 

d.fferentlyabsorbingoremittingoflight;chemicallyreactive:magne.icallyorelectronically ' ' 

encoded; or in someother way detectable. Specificexamplesoftagsare fluorescent or 1^^^^^^^^^ 

molecules that areanachedtotheprobe, or radioactivemonomers or molecules that can be added 

dunng oraftersynthesisoftheprobe molecule. Othertags may be immunogenic secuences(such as 
ep.tope tags) or molecules of known binding pairs (such as members of the strept/avidinbiotin 

system). Other tags and detection systemsare known to those of skill in the art, and can be used in 

the present invention. 

Though in many embodiments of the invention a single type of probe molecule (for instance 

one protein)atatimewill be used to assay the array.in some embodiments, mixtures of probeswil, 
be used, for instance mixtures of two proteins or two nucleic acid molecules. Such co-applied probes 
may be labeled with different tags, such that they can be simultaneously detected as different signals 
{e.g., two fluorophors that emit at different wavelengths). 

Probe standart: A probe molecule for use as a control in analyzing an array. Positive 
probe standards include any probes that are known to interact with at least one of the target 
polypepfdes of the array. Negative probe standards include any probes that are known not to 
.nteract with at leastonetargetpolypeptide of thearny. Probe standards that may be used in 
system includemoleculesofthesameclassasthetestprobe that will be used to assay the array For 
mstance, if thearray Will be used to examine the interaction ofaprotein With the polypeptidesofthe 

array.theprobestandardcanbeaproteinoroligo-orpolypeptide. However, this will not always be 
Mhecase. 

In some instances, as in certain of the kits that are subjects of this invention, a probe 
standard will be supplied that isunlabeled. such unlabeledprobestandardscan be used inala^^^ 

r^actionasastandardforcomparinglabelingefficiencyofthetestprobethatisbeingstudiedln " 
some embodiments, labeled probe standards will be provided in the kits. 

Probing: As used herein, the term "probing" refers to incubating an array with a probe 

molecule(usually in solution)inordertodetennine whether the probe molecule willbind to or 
.nteract with molecules immobilized on the array. Synonyms include «interrogating." "challengino " 
screenmg" and "assaying" an array. Thus, a universal protein array of the invention is said to be' 
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"probed" or "assayed" or "challenged" when it is incubated with a probe molecule (such as a 
polypeptide, nucleic acid molecule, or ligand). 

Protein/Polypeptlde: A biological molecule expressed by a gene or other encoding nucleic 
acid, and comprised of amino acids. More generally, a polypeptide is any linear chain of amino 
acids, usually about 50 or more amino acid residues in length. 

Arrays according to the present invemion include a plurality of polypeptide samples 
(targets) "spotted" at assignable locations on the surface of an array substrate. Tlie polypeptide at 
each spot can be referred to as a target polypeptide, or target polypeptide sample. In certain 
embodiments, polypeptides are deposited on and bound to the array surface in a substantially native 
connguration. such that at least a portion of the individual polypeptides within the spot are in a native 
configuration. Such native configuration polypeptides are capable of binding to or interacting with 
molecules in solution that are applied to the surface of the array in a manner that approximates 
natural intra- or intermolecular interactions. Thus, binding of a molecule in solution (for instance, a 
probe) to a target polypeptide immobilized on an array will be indicative of the likelihood of such 
interactions in the natural situation {i.e., within a cell). 

In ceitain arrays of the invention, referred to as pooled arrays, at least one panicular address 
on the anray is occupied by a pooled mixture of more than one substantially pure target polypeptide. 
All of the addresses on the array may contains pools of polypeptide, or only some of the addresses, 
depending on the use of the array. For instance, in some circumstances it may be desirable to array a 
target polypeptide associated with one or more non-target polypeptides, for instance a stabilizing 
polypeptide or linker molecule. In addition, the native conformation of certain binding sites on 
proteins can only be assayed for probe binding when the target polypeptide is associated with other 
molecules, for instance when the target polypeptide natively exists as one subunit of a multimeric 
complex. Pooled arrays of the current invention include those on which one or more of the addresses 
contains a multimeric polypeptide complex. In the case of such an array, it is envisioned that 
different probe molecules may bind to different polypeptides within the complex of "target" 
polypeptides. 

Although the identity of each probe in the pooled mixture at a specific address is known, the 
individual probes in the pool are not "separately addressable." The binding signal from a pooled 
address is the binding signal of the set of different (but mixed or associated) polypeptides occupying 
that address. In general, an address is considered to display binding of a probe molecule if at least " 
one polypeptide occupying the address binds to the probe molecule. 

Arraying pooled samples is also a powerftil tool in high-throughput technologies for 
increasing the information that is yielded each time the array is assayed. 

Protein purification: Polypeptides for use in the present invention can be purified by any 
of the means known in the art. See. e.g.. Guide to Protein Purification, ed. Deutscher, Meth. 
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In variations of the UFA technology, the detectable probe is a specific protein, polypeptide 
.ngle- or do«bIe-stn«ded nucleic acid, ligand or other natural or synthetic .olecule. depending on ' 
the .nteraction(s) being tested for. Such detectable molecules are used to detect and/or quantitate 
interaction with the polypeptides of the UFA. 

5 

Unless othenvise defined, all technical and scientific tenns used herein have the same 
meaning as commonly understood by one of ordinary skill in the art to which this invention belon-s 
Although methods and materials similar or equivalent to those described herein can be used in the' 
practice or testing of the presem invention, suitable methods and materials are described below ,„ 
case of conflict, the presem specification, including definitions, will control. In addition the 
materials, methods, and examples are illustrative only and are not intended to be limiting. 

n. Universal Protein Arrays 

•5 Arrays of the current invention provide several advantages over prior technologies and 

methods used for analysis of protein-molecule interactions. Although dot blot analysis with 
unpunfied protein p«parations has been used for the detection of specific antibody-antigen 
interactions, use of highly purified and active ,«:ombinam or native target proteins, in an array 
format, to assay for interactions with a specific probe has no. previously been reported. Additionally 
because the UFA assay can in some embodiments be carried out under non-denaturing conditions it ' 
provides a simple system for detecting native interactions between polypeptides and probe 
molecules. 

Known techniques fall far short of the UFA invention disclosed herein. In the case of far- 
' western bio. analysis, protein fractions were usually analyzed by SDS-FAGE and electrotransfeired 
.0 a membrane, followed by dena.uration and renaturation before probing with a radiolabeled protein 
probe. On average, only 1-^% of the activity (without considering the loss of protein during the 
transfer process) could be recovered for most p^teins with such a p««edure (Ge ./., Mol Cell 2 
75 1 -759. 1998). In contrast. UFA analysis as described herein simply uses active proteins directly 
spotted onto a substrate, such as a membrane. Tl-erefore. it is at least 10- to 100-fold more sensitive 
jO than the far-western blot assay. 

Since the amount of active protein assayed for interaction with the pn,be on a UFA can be 
the amount of protein applied, the affinities of individual proteins for a specific probe molecule 
either a protein or another type of biomolecule or other ligand. can be easily quantified and compared 
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with each other. 



Most existing assay systems were designed for a single purpose (to be probed with a single 
type of molecule). For example, the two hybrid system, co-immunoprecipiution, far-western 
blomng and GST assays were all used only for protein-protein interaction: the gel mobility shift 



PCT/USOO/06244 



•14- 



fMpiinling and cross-lmkiiig assay, »-M,rf,„. .. 
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"""Pies given are meant .n no way to limit the invention. 
~ Choice of polyppptM.^.,r^rt7 

ASC.,«nte,„„s»«s,.ssis<daus»,g,™,j,,„,^^,^ 

Any polypepLdes can serve as tMeB for «.!.«,. ^ ^' 
"UW be assonblKl to ^ f "»««.«, a™, 

-p*™^p.*..,,p™..s.no™itc::^""''-''™"'^^^ 
---».^j»p.^:::ir;::ra::"T'-" 

«y -ay ofexample. there are approximately 100,000 different eenes in th.h 
and « is expected that all ofthem will be known . ^^e human genome, 

every gene in the human .eno " '"^ P-'^-" 

.Hehumangenome.eve^p.o.ei„encodedror.yeachhuma„genecanbearrayedo„ 
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m a ch,»,os<™., „d, to a eolte^. „f23 UPA, w»kl »»„,p,ss «^ 

human genome. 

5 Genome-wide or chromosome-specific polypeptide arrays or array sets are no. limited to the 

human genome. Any species for which the genome is known or becomes known could be arrayed on 
one or a collection of a.«ys according to this invention. Such non-human genomes include those 
from dtsease organisms (eg., vinises. bacteria, parasites, etc.). research organisms (DrosopMa 
n.e.ar,osas,er, CaenorHatdUis e,e,ans, Xenopus ,ae.is, Ara.idopsis, SaccHaron,yces cer^^^ 
10 £sc/ier/c>i/fl CO//, etc.), and so forth. 

As demonstrated below (Example 3). UPA is an effective method to map protein interaction 
domatns and DNA- or RNA-binding domains of a protein. In certain UPAs of this invemion 
therefore, the target polypeptides a« collections of closely related sequences, for instance a series of 
nested polypeptide deletions of varying length or a series of polypeptides with different amino acid 
res.dues a, single sites throughout the sequence. Another alternative is a collection of different 
domain fragments of one protein or a family of closely related proteins; the domains may be fused to 
. another (non-target) protein. Such domain or mutation arrays can be used to determine which amino 
acd restdues or domains are important in known or suspected binding interactions between the base 
target protein and the probe or probes used to assay the array. 
20 Applications of the universal protein array technology are not limited to studies of 

transcriptional factors, although the following Examples 1-6 disclose embodiments of its use in 
connection with analysis of such facto.. UPA analysis could also be instrumental in understanding 
polypeptide binding characteristics of multiple protein profiles expressed during various disease 
states or^growth conditions, as well as in normal human or animal protein profiles, including profiles 
^3 from different transgenic animals or cultured cells. 

Polypeptide an,ys according to this invemion may also be used to perform further analysis 
on genes and targets discovered from, for example, high-throughput genomics, such as DNA 
sequencmg. DNA microanays. or SAGE (Serial Analysis of Gene Expression) (Velculescu . al 
Sciencer„.AUA^l, ,995). Polypeptide arrays according to this invention may also be used to' 
30 evaluate reagents for disease or cancer diagnostics, for instance specific antibodies or probes that 

react w,th certain polypeptides from infectious organisms or from tissues at different stages of cancer 
development. This technology can also be used to follow progression of polypeptide changes both in 

the same and in differemcancertypes, or in diseases other than cancer. Polypeptide an^^^ 
to th,s mvention may be used t identify and analyze prognostic markers or markers that predict 
t erapy outcome for various diseases or abnormal conditions, such as cancers. Arrays compiled from 
the pro.e,ns of hundreds of cancers derived from patients with known disease outcomes permit 
b.ndmg or association assays to be performed on those arrays, to determine imponant prognostic 
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markers, or markers predicting therapy outcome, which are associated with polypeptide binding 

characteristics. 

Polypeptide airays according to this invention may also be used to help assess the ability of 
certain dr.gs or potential drugs to interact with target polypeptides, or the ability of such molecules 
to block the interaction of other probes with arrayed polypeptides. 

The UPAS of this invention can be used to investigate receptor specificity of different types 
of known and suspected receptor molecules. Examples of receptors that can be investigated for 
probe-specific binding by arrays according to this invention include but are not limited to 
microorganism receptors (for instance, those found in fungi, protozoa, and bacteria, especially 
bacterial strains that are resistant to antibiotics); hormone receptors (including those involved in 
diabetes, growth regulation, vasoregulation, and so forth); and opiate receptors (involved in 
biological responses, for instance to addictive drugs). 

Also envisioned are arrays that are custom produced for the researcher, with an arrayed 
collection of polypeptides tailored to a specific research project, research system, etc. 

Not in any way intending to be limited to the list below, the following is a list of the types of 
collections of polypeptides that can be arrayed on a UFA according to this invention: ail or 
substantially all the proteins encoded for by the genome of an organism; all or substantially all the 
proteins encoded for by a chromosome of an organism; proteins expressed in a cell during a 
particular growth phase or environmental condition; proteins expressed in a cell under a particular 
abnormal state (such as cancer, disease, or infection); proteins expressed in cells at various times 
during the progression of a disease or condition (e.g., during progression of a tumor, or developmem 
of a chronic disease such as Alheizmers); proteins expressed in a particular cell type; proteins from a 
particular protein family (e.g„ DNA polymerases, cell surface proteins, transmembrane proteins or 
fragments [such as soluble fragments] thereof, oncogene proteins, mmor suppressor proteins, and so 
forth); proteins that show sequence homology to each other; proteins thai share secondary structural 
characteristics: proteins that associate to form multimeric complexes (..g.. the subunits of a ribosome 
or a membrane ATPase); viral epitopes; domains of proteins; proteins from differem species; and 
collections of fragments of any of these protein collections. 

1: Production of substantially pure target polypeptides 

Polypeptides for use as targets on the subject arrays can be produced by any technique that 
yields native protein. These techniques in general include expression from engineered DNA 
constructs, extraaion from native samples (eg. . clinical samples), or de novo synthesis of 
oligopeptide or polypeptide fragments. 

Expression of the target polypeptides can be carried out using well known techniques. For 
instance, panial or full-length cDNA sequences, which encode the protein of interest as a target on 
the UFA. may be ligated into bacterial expression vectors. Methods for expressing large amounts of 
protein from a cloned gene introduced into Escherichia coli (E. coif) may be utilized for the 
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production and purification of intact, native target proteins. Methods and plasmid vectors for 
producing fusion proteins and intact native proteins in bacteria are described in Sambrook et ai 
(Sambrook et ai, In Molecular Cloning: A Laboratory Manual, Ch. 17, CSHL, New York, 1989). 
Such fusion proteins may be made in large amounts and are easy to purify. Native proteins can be 
5 produced in bacteria by placing a strong, regulated promoter and an efficient ribosome-binding site 
upstream of the cloned gene. If low levels of protein are produced, additional steps may be taken to 
increase protein production; if high levels of protein are produced, purification is relatively easy. 
Suitable methods are presented in Sambrook et ai (In Molecular Cloning: A Laboratory Manual, 
CSHL, New York, 1989) and are well known in the ait. Often, proteins expressed at high levels are 
1 0 found in insoluble inclusion bodies. Methods for extracting proteins from these aggregates are 
described by Sambrook et ai (In Molecular Cloning: A Laboratory Manual, Ch. 1 7, CSHL, New 
York, 1989), Vector systems suitable for the expression of lacZ fusion genes include the pUR series 
of vectors (Ruther and Muller-Hill, EMBOJ. 2:1791, 1983), pEXl-3 (Stanley and Luzio. EMBOJ. 
3: 1429, 1984) and pMRlOO (Gray et ai. Proc, Natl. Acad Sci. USA 79:6598, 1982). Vectors 
1 5 suitable for the production of intact native proteins include pKC30 (Shimatake and Rosenberg. 

-Nature 292: 128, 1981 ), pKKl 77-3 (Amann and Brosius, Gene 40: 1 83, 1985) and pET-3 (Studiar and 
. Moffan, 7. MoL Biol. 1 89: 1 1 3, 1 986). 

C. Choice of array format and structure 

UPAs may vary significantly in their structure, composition, and intended functionality. 
20 The UPA system is amenable to use in either a macroarray or a microarray format, or a combination 
thereof. Such arrays can include, for example, at least 50, 100, 150, 200, 500, 1000, or 5000 or more 
array elements (such as spots). In the case of macro-UPAs, no additional sophisticated equipment is 
usually required to detect the bound probe on the UPA, though quantification may be assisted by 
known automated scanning and/or quantification techniques and equipment. Thus, macro-UPA 
25 analysis can be carried out in most research laboratories and biotechnology companies, without the 
need for investment in specialized and expensive reading equipment. 

Examples of substrates for UPAs include glass (eg., functionalized glass). Si, Ge, GaAs, 
GaP, SiO,, SiN4, modified silicon nitrocellulose, polyvinylidene fluoride, polystyrene, 
polytetrafluoroethylene, polycarbonate, nylon, fiber, or combinations thereof. Array subsn-ates can 
30 be stiff and relatively inflexible (eg., glass or a supported membrane) or flexible (such as a polymer 
membrane). One commercially available microarray system that can be used with the arrays of this 
invention is the FAST™ slides system (Schleicher & Schuell, DasseK Germany), which incorporates 
a patch of polymer on the surface of a glass slide. 

In general, a target on the array should be discrete, in that signals from that target can be 
35 distinguished from signals of neighboring targets, either by the naked eye (macroarrays) or by 

scanning or reading by a piece of equipment or with the assistance of a microscope (microarrays). 

Macro-UPAs are often arrayed on polymer membranes, either supported or not, and can be 
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individual dots. 

^''''^'^^-"'"^'^'"J-^ofasize.argeenoughtopennittheirdete^^^^ 
. ass.sta„ce of a microscope or other sophisticated enlargement equipment. Thus, spots may be as 

-al.asabout0.1mmac.ss.withaseparationofabout the same distance.a„d can be larger 

Largersamplespotsonmacroanays.forexamp,,maybeabout0.5J.2.3.5.7.orl^ 

Even larger spots may be larger than ,0 mm (. cm) across, in certain specific embodiments. TTe 

array s.ewi„in genera, beco^latedthesi^ofthesamplespouappliedtothearra, in that .^^^^ 

spots w. usually be found on .argerat„ys.whilesmal.erspotsmay be foundonsmallera.^^ 
This correlation is not necessaiy to the invention, though. 

'";--y"PAs.acommonfea.u«isthesmal.sizeofthetargetan.y.forexampleon 
.he order ofasquared centimeter or less. A squa^d centimeter (1 cm by 1 cm) is large enough to 
contam over2.500 individual target spots, if each spot has a diameter of 0,1 mm and spots are 
sepanued by 0.1 mm from each other. A two-fo.d reduction in spot diameterand separation can 

aowor,0.000 such spotsinthesamearray.andan additional halving ofthesedimensions would 
a iowfor40,000spo.s. Usingmicrofabrication technologies, such as photolithography.pionee^ by 
the computer i„dusto..spo.sizesoflessthan0.01 mm are feasible, potentially providing forovera 
quarterofamilliondifferenttargetsites. T^-e power ofmicroa.ay-fonna.UPAs resides not only in 
.he number of different polypeptides that can be probed simultaneously, but also in how linle protein 
is need for the target. 

The amouM of polypeptide target sample that is applied to each address of an array will be 
largely dependent on the anay format used. For Instance, microarrays will gene.lly have less 
polypeptide applied at each address than will mac^arrays. By way of example, individual targets on 
a macroatray can be applied in the amount of about 1 pmol or greater, for instance about 3 pmol 
about 5 pmol. about 7.5 pmol. about 10 pmol, about 1 5 pmol or more. In contrast, samples applied 

to md,vidua. spots onamicroarny will usually be less than, pmol in each spot, for insta^^^^ 
.8 pmol. about 0.5 pmol. about OJ pmol. about 0. 1 pmol. about .05 pmol or less. 

In addition, the surfece area of sample application for each "spot" will influence how much 

Poypeptideisimmobilizedonthea^ysurface. Thus, a larger spot (having a grater surface area) 

wnigenerally accept or requireag^ter amount of target molecule thanasmaller sample spot 
(having a smaller surface area). 
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The target polypeptide itself (e.g.. the length of the polypeptide, its primary and secondary 
st™cture.«s binding chan.cteristicsi« relation to theatra^ 
eachtargetpo.ypeptideisapp,iedtoanarny.Optin.a.a.ountsoftarget.olecu^^ 

--^^^'''-nventioncanbeeasi.ydetennined.forins.ancebyapplyingvaryinga.ountsofthe 
targe, polypeptide to an array surface and probing the a^y with a probe .olecule known to interact 
w.th that target. In this manner, it is possible for one of oniinaty skill in the art to empirically 
determine of range of target molecule amounts that produce inteqjretable results 

Another way to describe an an^y is its density - the number of samples in a certain 
specfled surface area. Pormacroarrays of the currem invention, anay density will usually be 
between about one target per squared decimeter (or one target address in a 10 cm by .0 cm region of 

the array substrate)to about 50 targets per squared centimeter(50 targets withina 1 cm by I cm 
reg,on of the substrate). For microarrays. a-ray density will usually be one target per squared 
centimeter or more, for instance about 50, about .00. about 200. about 300. about 400. about 500 
about 1000. about .500, about 2,500. about 5.000, about 10.000, about 50.000. about .00.000 or ' 
1 5 more targets per squared centimeter. 

^ Application of targets to array c: 

Targets on the array may be made of oligopeptides, polypeptides, proteins, or fragments of 
these molecules. Oligopeptides, containing between about 8 and about 50 linked amino acids, can be 
synthesized readily by chemical methods. Photolithog^phic techniques allow the synthesis of 
.0 hundreds of thousands of different types of oligopeptides to be separated into individual spots on a 
single chip, in a process referred to as in sUu synthesis, as has been done with oligonucleotide arrays 

Longer polypeptides or proteins, on the other hand, contain up to several thousand amino 
acd res>dues. and are no, as easily synthesized through in .,ro chemical methods. Instead 
poiypeptides and proteins for use in UPAs are usually expressed using one of several well known 
cellular expression systems, including those described above. Alternatively, proteins can be isolated 
from their native environment, for instance from tissue samples or environmental samples, or from 
expression chambers in the case of engineered expressed polypeptides. After extniction and 
appropriate purification, the polypeptide can be deposited onto the array using any of a variety of 
techniques. 

> m the methods disclosed in this applications, target polypeptides can be delivered to the 

substrate of the array by various different mechanisms. One is by flowing within a chamiel defined 
on predefined regions of the anay substrate. Typical "flow channel" application methods for 
applying the polypeptides to arrays of the present invention are represented by dot-blot or slot-blot 
systems (see. e,.. U.S. Patents No. 4.427,4,5 and 5.283,039). One alternative method f r applyin- 
the targets to the array substrate is "spotting" the target polypeptide on predefined regions (each 
corresponding to an airay address). In a spotting technique, the target molecules are delivered by 
directly depositing (rather than flowing) relatively small quamities of them in selected regions For 



wo 00/54046 



PCT/USOO/06244 



■20- 



20 



25 



30 



simultaneously. 

"^^"y.^^targetpolypeptidesaredepositedonthearn^ 
arc substantially .reve„ib.yb6u„d.othea„ay.Forexa.ple.atarget .ay 

.o«than30%ofthepolypeptideo„thearnyattheendofthebindin8processcanbewashed^ 

us.ngbuffersoftheUPAsysten,(e,.lo.brhighsaltbuffe„orstrippingbuffers). mother 
embodiments, no more than.m. no mote thanm.non,oreth^ 

than5/,or„o.ore.ha„3o/oofthepolypeptideonthea,raya.thee„dofthebindi^^ 
washed off using buffers of the UPA system. 

Depending on the atray substrate used, the substrate alone may substantially irreversibly 
md the target without further linking being necessary (e,.. nitrocellulose and PVDF membranes) 
in other instance, a linking or binding pn«:ess must be perfom-ed to ensure binding of the 
polypeptides. Examples of linking processes are known to those of skill in the art. as are the 
substrates that require such a linking process in order to bind polypeptide molecules. TT,e target 
polypeptides optionally may be attached to the array substrate through linker molecules. 

m certain embodiments, the non-sample regions of the array surface (those regions of the 

array surface that donot contain target molecules)are blocked in order to prevent or inhibit bind^ 
of the probe molecules directly to the array surface. 

I« « beneficial in certain embodiments to apply a known amount of each target polypeptide 
on the .my. m particular embodiments, an essentially equal amount of each target polypeptide is 
apphed to each spot. QuantifK^tion and equivalent application of the targets penniu comparison of 
probe bmding affinity between the different tan^ets. Measut^ments of the amount of specific target 
protems may be carried out dm,ugh many techniques well knoM^i in the an. These include 
quanttutive immunoblot analysis, enzyme activity assays (where appropriate), and commercially 
available protein quantification kits (.g.. Bio-Rad protein assay systems), 
concentration of protein in a sample regardless of biological characteristics of the specific protein 
bemg measured. 

Many other techniques could be used to measure the amount of a target protein present in a 
sample, For mstance. the amount of target protein in a sample could be measured using a 

quant,tat,ve enzyme-linked immunosorbant assay CELISA') as described by Aboagye-Mathiesener 
a/ (P/ace«/a 18:155-61, 1997). 

In certain arrays of the invention, referred to as pooled array, at leas, one particular address 
on the array .s occupied by a pooled mixture of more than one substantially pure target polypeptide. 
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An Of he addresses on the a^y.ay contains poolsofpoly^^^^^^^ 

epe„d.„go„*euseofthea^y.PoMnstanceJ„son,ec.cun,stancesit.aybed.^^^^^ 
.aiiet polypeptide associated with one or .ore non-target polypeptides, for instance a stabilizing 
polypeptide or linker molecule. ,„ addition, the native conformation of cenain b.nding sites on 
prote.ns can only be assayed for probe binding when the target polypeptide is associated with other 
molecules, for instance when the target polypeptide natively exists as one subunit of a multimeric 

complex. Pooled atrays of the current inventioninclude those on which one or .ore ofthe addresses 
contatns a multimeric polypeptide complex. In the case of such an array, it is envisioned that 
d.fferem probe molecules may bind to different polypeptides within the complex of "target" 
polypeptides. 

Although the identity of each probe in the pooled mixture at a specific address is known, the 
.nd.v.dua. probes in the pool are not "separately addressable." binding signal from a pooled 

addressis the binding Signal of the set of differen.(butmixed or associated) polypeptides occupy^ 
that ad .ess. In general, an address is considered to display binding of a probe molecule if at least 
one polypeptide occupying the address binds to the probe molecule. 

Arraying pooled samples is also a powerful tool in high-throughput technologies for 
.ncreas.ng the infonnation that is yielded each time the a^ay is assayed. Methods for analyzing 
signals from arrays containing pooled samples have been described, for instance in U.S. Patent No 
5,744,305, mcorporated herein by reference in its entirety. 

E- C hoice of probe molecule( s'> 

Any molecule that might bind to or interact with one or more polypeptides can be used as a 
probe w.th the disclosed ar^ys. ,„ specific embodiments of the current invention, probes may be 
from dtfferem molecular classes (e.^.. nucleic acids, oligo- or polypeptides, or various types of 
l-gands). Probes (especially those that are polymeric cha.ns) may be of various lengths, and different 
results may be obtained from the same array by using related probe molecules of different length 
L.kew.se. varying the sequence of polymeric chain probes may provide valuable binding data" 

Though in many embodiments of the invention a single type of probe molecule (for instance 
one protem) at a time wil, be used to assay the array, in some embodiments, mixtures of probes will 
be used simultaneously, for instance mixtu«s of two proteins or two nucleic acid molecules 
S.mu,taneousmultiple-probi„g(e,. double-probing) can be used to d«^^^ 

or bmd.ngsystemsthat«qulrc the interaction Of mo^moleculesthanjust one polype^^^^^^ 
one probe molecule. * 

— Labeling and d etection of pm he molecule( s^ 

Usually, probe molecules used to assay the disclosed UPAs are detectable. Probes can be 
detectable based on their inherent characteristics (..,.. immunogenicit>.) or can be rendered 
detectable by being labeled with an independently detectable tag. Such tags include fiuorescem or 
lummescen, molecules that are anached to the probe, or radioactive monomers or molecules that can 
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and can be used in the present invention. 

labeling different probes with diffe^nt tags to enable simultaneous detection ofbindino 
two br more probes on the polypeptides of an an,y. Multiple-label challenges to an array of this" 

■nvention can alsobeusedtoexamineany competitive bindingbetweenthetwoar^^^ 

poypept.desofthea.,Forcompetitivebin^ 

be detectable. 

Computer assisted (automated^ det^rinn ^^^i^^j, ^.f UPAc 

'^^'^^'^g^n^^tedbyassayingauniversalproteinarniyaccordingtothisinventionc^ 
analyzed using Icnown computerized systems. For instance, the am.y can be read by a computerized 

reader" or scanner and the quantification ofthebindingofprobe to individual addresses on the array 
carr,edou,usingcomputeralgori.hms. Such analysis oftheanay can be «fe,red to as "automated 
detecfon" in that the data is being gathered by an automated reader system. 

^'^'''els.ha.emi.detectableelectromagneticwaveorpanicles.theemittediigh. 
ie.S. fluorescence or luminescence) or radioactivity can be detected by very sensitive cameras 
confocal scanners, image analysis devices, radioactive film or a Phosphoimager. which capture' the 

s.gnaIs(suchasacolorimage)from the array. Acomputerwith image analysis software detects this 
.mage, and analyzes the intensity of the signal for each probe location in the array. Signals can be 
compared between spots on a single anay. or between arrays (such as a single array that is 
sequentially probed with multiple different probe molecules). 

Computer algorithms can also be used for comparison between spots on a single array or on 
multrple arrays. In addition, the data from an array can be stored in a computer readable form 

Certam examples of automated array readers (scanners) will be controlled by a computer 
and software programmed to direct the individual components of the reader (e.g.. mechanical 
components such as motors, analysis components such as signal interpretation and background 
subtraction). Optionally software may also be provided reader to control a graphic user interface and 
one or more systems for sorting, categorizing, storing, analyzing, or otherwise processing the dau 
output of the reader. 

To "read- an array according to this invention, an amty that has been assayed with a 

detectableprobe to produce binding(e,.,abindingpattem)canbeplacedinto{oronto. or below 

etc.. depending on the location ofthe detector system)the reader andad^ectable signal indicativeof 
probe binding detected by the reader. TT,ose addresses at which the probe has bound to immobilized 
polypeptide sample provide a detectable signal. e.g., in the form of electromagnetic radiation TT.ese 
detectable signals could be associated with an address identifier signal, identifying the site of the 
complex. The reader gathers information from each of the addresses, associates it with the address 



wo 00/54046 



PCTAJSOO/06244 



-23- 



10 



15 



20 



25 



.denffier signal, and recognizes addresses >vith a detectable signal as distinct from those not 
producing such a signal. The reader is also capable of detecting intermediate levels of signal 
between no signal at all and a high signal, such tha, quantification of signals at individual addresses 



is enabled. 



Cenam readers that can be used to collect data from the arrays of this invention, especially 
those that have been probed using a fluorescently tagged molecule, will include a light source for 
opfcal radiation emission. The wavelength of the excitation light will usually be in the UV or visible 
range, but .n some situations may be extended into the infta-red range. A beam splitter can direct the 
reader-emitted excitation beam into the object lens, which for instance may be mounted such that it 
can move m the x, y and z dit^ctions in relation to the surface of the array substrate. H.e objective 
lens focuses the exciution light onto the array, and more particularly onto the (polypeptide) targets 
on the array. Light at longer wavelengths than the excitation light is emitted from addresses on the 
array that contain fluorescently-labeled probe molecules (i.e.. those addresses containing a 
polypeptide to which the probe binds). 

In certain embodiments of the invention, the array may be movably disposed within the 
reader as it is being read, such that the array itself moves (for instance, rotates) while the reader 
detects information from each address. Alternatively, the array may be stationary within the reader 
wh.le the reader detection system moves across or above or around the array to detect information 
from the addresses of the array. Specific movable-fonnat array readers are known and described for 
mstance in U.S. Patent No. 5. 922.61 7, hereby incoq^orated in its entirety by reference. Examples of 
methods for generating optical data storage focusing and tracking signals are also known (see for 
example. U.S. Pat. No. 5,461.599. hereby incorporated in its entirety by reference). 

For the electronics and computer control, a detector (e.g. . a photomuhiplier tube, avalanche 
detector. Si diode, or other detector having a high quantum efficiency and low noise) converts the 
optical radiation into an electronic signal. An op-amp first amplifies the detected signal and then an 
analog-to-d.giul converter digitizes the signal into binary numbers, which are then collected by a 
computer. 



^ HI. Examples 

Example 1 : Preparation of a UPA 

Methods and Materiak 

To identify target proteins to which the transcriptional coactivator p52 will bind, the protein 
array system for which a target arrangement key is shown in Table 1 was provided. TTie general 
transcription factors, activators and coactivators arrayed were overexpressed either in bacteria 
baculovTus or in mammalian cells and purified to near homogeneity as previouslv described (Chiano 
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«..MO.,,2,2,4,.2,„.,„;K=„h„.„^„..<^.aea,,U««.«4S,,„„.U, 
G.«^.«.,/»A£^U74, 57.,,, Th,.,™...^i„i„,s„|p^^,„„^^ ' 

«7. m). OST.„.M„ «o, p,«|„ ,asT.N„, .adress ,2tf 0 was p.p«d 

P<»».O»»0.707.i„te„™,^prti«,.„„«,;„^S,^,^^„,.„„ <;H.^i,„j 

W.»s.n«d„.HMK*(mSV),OST-K„0„,^,«„.C.«,„,.,„.,„„«^ 
as a negative control in the experiments. 

^"'^^"^""•^P"<>'("o™alized by Bio-Rad protein assay, Bio-Rad, Hercules. CA) 
each of the 48 highly purified proteins (or fh«.ions) was spotted on a 12 x 8 c. nitrocellulose 
me. rane using a 96.wen dot blot appan.tus (Bio-Rad. Hercules. CA). This apparatus provides 

sample appiicaiontoa^embn^etofonn an atray arranged in twelve n,wsandeig^^ 
a.rangen,en. of .he polypeptide targets in the array is shown in Table I . which corresponds to the 

array results Shown in Figures, andl Each sample wasduplica^ 
a useful intemal control. 

HEPES Na pH 7.9. 0.2 n,M EDTA. 10 n,M 2.me.aptoethanol and 0.5 n,M PMSF) and duplicated in 
two adjacent wells. Each well was rinsed with 2 x 500 ^ buffer A .00 and the vacuum kept for 3-5 
m.nute. After removal from the dot blot apparatus, the protein ar^y was Hnsedw^ 
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(I) 

TFIIA [ TFIIA 
(5) 

TFIIE I TFIIE 



(9) 

RXR I RXR 



_c I d 
(2) 

TFIIB I TFIIB 
(6) 

TFIIF I TFIIF 



(13) 

G4-94 I G4-94 

(17) 

G4-CTF I G4-CTF 



(21) 

G4-Tat I G4-Tat 



(iO) 
JR I TR 
(14) 



e I f 

(3) 

TBP J TBP 

(7) 

_ f:TFIIH I f:TFIlH 



04-147 I G 4.147 
(18) 

G4-Spl I G4.SDI 



(22) 

PC4-PJ _PC4-P 
(26) 



(II) 

Oct I I Oct I 



(15) 

G4-AH I G 4-AH 
(19) 

G4-E1A I G4-EIA 



(23) 

PC4-N I PC4-N 



g |_ h 

(4) 

f:TFlID I f :TFIID 
(8) 

Pol II I Pol II 



(12) 
_Spl I Spl 



(16) 

G4-VPI6 I G 4.VP16 
(20) 

G4-1E I G4-IE 



(24) 

PC4-C I PC4-C 




• Abbrcvianon. used .„ I able I are explain«J above, in tf,e 



Abbreviations section (lA). 



Each sample was duplicated in two adjacent wells. 



8 cm (height X width) with eight columns and 



The actual size of the membrane is 12 X 



twelve rows. 



Example 2: Removal of probe molecules from 
the UPA. 



Methods and Materials 
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Examples: Interaction wi.h a protein pr be. 
Methods and Mat^^l^ 

Result 

2^ ^^^"'^■P^2*^ labeled «v//ro with fvMpiATDw „ 
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topo I, either recon,bi„a„t proteins expressed in baculovirus (Fig. 1 , addresses I la-Q or naturally 
purified protein fron, mammalian cells (addresses I Ig/h). This observation demonstrates that p52 
mteracts with the nucleolin rather than the topo 1 present in the SR protein fraction, which is 
consistent with the recent observation that nucleolin is a component of the muhiprotein complex 
associated with p52 in HeLa cells. 

Nucleolin has been implicated in regulating pre-rRNA processing (Bouvei e, al., EMBO J. 
16.5235-5246. 1 997). p,e-mRNA splicing (Ishikawacr a/.. A^o/. Cell. Bid, I3.430M310 1993) B 
cell-specific transcription (Hanakahi et al.. Proc. Natl. Acad ScL USA, 94. 3605-36 10 1997) 
unwinding DNA. RNA or DNA-RNA duplexes (Tuteja e, al.. Gem, 28, 143-148, 1995) and ' 
mediating cell doubling time in human cancer cells (Derenzini et al.,.Lab. Imest., 73. 497-502 
1995). Like the splicing factor ASF/SF2. nucleolin also contains RNP type RNA-binding domains as 
well as RGG repeats (Bouvet etal., EMBO J., 16. 5235-5246, 1997; Valdez et al., Mol. ln,munol., 
32, 1207-1213, 1995). Its activity can be modulated through mitosis-specific phosphorylation by 
P34cdc2 kinase or casein kinase 11 (Tuteja et al.. Gene, 28. 143-148, 1995). Therefore, it would be 
mteresting to further examine the biological significance of nucleolin interaction with the general 
transcriptional coactivator and splicing regulator p52. ' 

In addition to GST-K-p52, other protein probes have also been tested in the UFA system. 
Figure 3 shows the binding activity of «P-iabeled splicing factor ASF/SF2, a member of the SR 
protein family, to 16 different selected proteins in a 4 by 4 array (Fig, 3A). ASF/SF2 significantly 
bound to five of the 16 proteins, including the affinity-purified lFlID complex (Fig. 38 and C 
address 2d). retinoid-X receptor (address 3a), histone H 1 (address 3c). co-histones (address 3dJ and 
ASF/SF2 itself (address 4b). However, after washing the UPA with 500 mM KCI, ASF/SF2 
appeared to have the highest affinity for itself (Fig. 3C. address 4b), which is in agreement with the 
previous observation that in v,7ro translated ASF/SF2 could strongly bind to GST-ASF/ SF2 in a GST 
pull down assay (Xiao and Manley. EMBO J., 17. 6359-6367. 1998). However, ASF/SF2 also 
showed high affinity for the TFIID complex. Since ASF/SF2 did not interact with TBP (address 2c). 
ASF/SF2 might interact directly with TBP-associated factors. Whether such an interaction reflects 
the fiinction of TFIID or ASF/SF2 in transcription or pre-mRNA splicing or coupling of these could 
also to be investigated using the disclosed UPA technology. Taken together, these experiments 
demonstrate that UPA can be used to detect protein interactions with various targets. 

Using the same UPA. it is shown that PC4 with a single point mutation (Phe-Pro) at 
position 77 lost both dsDNA- and ssDNA-binding activity (Fig. 2A and B. addresses 8c/d). but still 
retained RNA-binding activity (Fig. 2C, addresses 8c/d). In contrast, phosphorylation of i>C4 by 
casein kinase 11 stimulated the DNA-binding activity (Fig. 2A and B. addresses 6c/d). but reduced its 
RNA-binding activity (Fig. 2C. addresses 6c/d). These observations demonstrate that UPA is an 
effective method to map protein interaction domains and DNA- or RNA-binding domains of a 
protein. 
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Example 4: Interaction with a DNA probe. 
Methods and Matwiai.. 

A .., ''^''•"'^™Pl>2)»nilreprol)ed«iilia'=pl.h.i»i 
<louW«irand«l„li|onutle«id.(«bp)connmi„.,h,.. P»»«aw,«,a M,wrf 

el« °""""'"'»«*"''"»"n»n«,jo,la„M«pmKttr 

OOOGGTOOOOOCCCOTTCOTCCTCACTCTOTCCOCATCOCTOTCTCCC , 

a G-50 nick column (Pharmacia Biotech United KinoHnm^ D. "onmrough 

conUining60mMKCI2xDenhard/ .^^^^^^^ 

""^ 2^ '^•^-'hardts solution and 25 Mg/ml poly(dG dC) (Sigma, St Louis MO) at 

~.pe..eror30mi„utes.PoHnte.ctio^^ 

was added to the same buffer and incubation was earned out at 4- c for >,2 hou. Th 
autoradiography and quantification. 

*DNA„lT!r'*"''""'"*^''''™*''»^>*'''--"»»™-»f*e 
-^NAp.^.„^H«*,..,„,^„^,.„,„,,,^,^^^^^ J' 



25 Results 



The results shown in Figure 2A indicate th»t »ft u • . 

^ '^^^'"^*«=*^«^''at, after washing with 500 mM salt, 

derivatives, mcludingwild.typePC4faddr^c«o«/»,^ . «roomercL4 
position 77 rPh. P ^ ""^«. » unfile amino acid change at 

ps.t,on7(Phe-.Pro)comp,e.e,y abolished thedsDNAbindingabi.ityofPC4(addres^ 
These results are h agreement with thP„K i«»"aresscs sc/a;. 

.mo, DNA..^„5 ,„,i^ ^^^^^^ ^^^^ ^^^^^^ ^ 



10 
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DNA was barely detectable Hou,. . ^ '""y °" t° TATA box-containing 

however, many other eeneml ^„„„ 2657-2683. 1 996). On the other hand. 



E>^ample 5: Interaction with a RNA probe. 
Methods and Mat^riaic 



5 



outat4°r fnr^iok . . /3 1-/59, 1998). Interaction was carried 

C for >I2 hours .„ the p«sence of 20™M HEPES Na pH 7.9. 50/, g.ycero, ,0 . 
niercaptoethanol.0^mMEDTANaDH8 0 60mM,.r, ■> ° g'y^rol. 1 0 mM 2- 



Results 



This protein array system was also used successfiillv t« 
probe transcribed from the SV40 ear.v ' • ^" "^^^ 
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•"'n ng2A and B). Finally PCAF 



5 



Mgtho djand Material. 

I J M was added instead oftheRNA probe. 

Results 

This protein array svsfem «/, i 

Strongly 

Table 2 
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29 



30 



31 



32 



8a/b 



8c/d 



8e/f 



8g/h 



PC4-m3/bactcria 



PC4-m4/bacteria 



PC4-m5/bacteria 



PC4-m6/bacteria 



PC4-m7/bacicria 



5.4 



3.8 



2.2 



2.8 
0.6 



21.7 
31.5 



1.4 



37 



41.2 



.45 



4.5 



27.4 
37 



66.7 
56.9 



61.2 



iOO 



89.5 



87.3 



75.4 



PC4 (R27A/K28I/K29A) 



PC4 (R47N/K531/R59A) 



PC4 (F77P) 



PC4 (K29A) 



PC4 (K41A) 



PC4-wt/bacicria 



p52/bactcria « 



2.8 
2.1 



66.7 



72.7 



0.8 
1.4 



49.8 



transcriptional coactivator (wild type) 



transcriptional coactivator 



p75/bacteria • 



0 

0.7 



54.6 



transcriptional coactivator 



9g/h 



37 



p75-C/bacteria » 



lOa/b 



1.3 



38 



p300-C/bacuiovirus " 



66.2 



4.6 



lOc/d 



11.6 



40 



lOg/h 



41 



lla/b 



42 



llc/d 



PCAF/baculovirus 
PC AF-C/baculo virus " 



11.3 



14.5 



coactivator (C-terminai 326-530) 



2,5 



5.3 



2.4 
8.4 



14.5 



98.5 



transcriptional coactivator (1 135-2414) 



TAF250/baculovirus ' 



1.7 



1.1 



Topo I/bacuiovirus ^ 



8 

0.6 



74.3 



histone acetyl transfcrsc 



'PCAF(352>832r 



10.9 



Topo l/baculo virus ^ 



1.1 
4.7 



I 

2:4 



0.9 



transcriptional coactivator 



12.4 



2.1 



12.6 



DNA unwinding/transcription 



Topo 1 (Y723F) 



Topo I^acuiovi^us ^ 
Topo l/HeLa 



5 

2.3 



1.6 
1.7 



1.1 



Topo 1 (wild type) 



Hg/h 



0.7 



76.5 



native Topo 1 



ASF/SF2/bactcria • 



46 



I2c/d 



13.5 



SR/HcLa ' 



47 



12e/f 



GST-Nu/bacteria 



55 



48 1 12g/h 



100 



GST-K/bactcria 



33.5 
80 
0.4 



89 



40.4 



100 



9.7 



3.3 



8.3 



2.8 



0.4 



1.5 



1.8 



splicing factor (SR protein) 
splicing factors (SR family) 



pre-rRNA processing factor (nucleolinT 
negative control 



'Gc e/o/.. Kfethods Enzymoi, 274, 57-71, 1996 
Chiang ei aL, EMBOJ., 12. 2749-276Z 1993 

* Kcrshnar et aL J. Bioi Chem., 18. 34444-34453 1998 
'^Uoe/aA.Ce//, 71,231-241. 1992 

' Jackson and Tjian. Proc. Naii Acad. Set. USA, 86. 1 78 M 785 1 989 
Ge et aL Proc. Nati Acad. Set. USA. 91. 12691-12695 1994 ' 
*Gc et aL Moi Cell 2, 751-759. 1998 
' Ogryzko et aL Cell, 87. 953-959. 1996 

* Mizzcn etaL CetL 87. 1261-1270. 1996 

' Wang and Rocdcr. Mol. Cell, I, 749-757, 1998 

' Pourquicr ef al., J. Biol Chem., 272. 26441-26447. 1997 

* Zahlcr et aL Genes Dev., 6. 837-847. 1992 

" Valdcz et al. , Mol. Immunol.. 32. 1 207- 1 2 1 3 1 995 

The number, position (address), name/source (and related reference), affinities for each probe 
and known function for each of the 48 target polypeptides are indicated. The highest affinities of the 
individualized proteins for each probe molecule IGST-nucleoiin for p52 (addresses 12e/0, PC4-P for 
the dsDNA (addresses 6c/d). SR for the ssDNA (addresses 1 2c/d) and PC4.m4 for the RNA 
(addresses 8a^)] where nonnalized to 100 and are indicated in bold. 

Example?: Kits 



UPAs as disclosed herein can be supplied in the form of a kit for use in molecule binding 
analyses. In such a kit. at lest one polypeptide array is provided. The kit will also include 
instructions, usually written instmctions. to assist the user in probing the array. Such instructions can 
optionally be provided on a computer readable medium. 

Kits may additionally include one or more buffers for use during assay of the provided 
array. For instance, such buffers may include a low stringency, a high stringency wash, and/or a 
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stripping solution. These buffer* m« u 

Kits may in addition include eith-r a 

probe suppiWi, a,, 

*n some embodiments of the c.rr^n, • 

p«'«-«.««^*^^^t2"r*;?'*™'""''''''""^''-' 

^ ^'j'"""'* ■ "-•b.i ««p«««, 

UPAS. '^'^"^<^ aJ^ve for kits for the assaying of 
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We claim: 



I. 



A protein interaction assay comprising: 
contacting an anay of substantially pure target polypeptide molecules stably 
associated with a surface of a solid support with a detectable probe molecule under conditions 
sufficient to produce binding; and 

detecting the binding. 

2. The assay of claim 1. fiirther comprising removing unbound probe mciecuie prior 
to detecting the binding. 

3. The assay of claim 1. wherein the detectable probe molecule comprises a single- 
stranded nucleic acid, a double-stnmded nucleic acid, a protein, or a Hgand. 

4. The assay of claim 1 . wherein the detectable probe molecule comprises a tag useful 
for detection. 

5. The assay of claim 4. wherein the tag is fluorescent, luminescent, or immunogenic. 

6. "nie assay of claim I. wherein the array comprises a microarray. 

7. The assay of claim >. wherein the polypeptides are associated with the support at 
discrete addresses. 

8. The assay of claim 7. wherein each address contains only one substantially pure 
target polypeptide. 

9. The assay of claim 1, wherein the binding detected is a binding pattem. 

10. An assay to determine polypeptide-binding of a probe molecule, comprising: 

(a) preparing a labeled sample of the probe molecule; 

(b) contacting the labeled sample with an array of substantially pure target 
polypeptides stably associated with the surface of a solid support under conditions sufficiem to 
produce binding; 

(c) separating unbound labeled probe from the array to produce a probed array; and 

(d) detecting the binding. 

11. The assay of claim 1 0. further comprising contacting the probed array with at least 
one additional member of a signal producing system. 

12. The assay of claim 1 1. wheroin the contacting the probed array with at least one 
additional member of a signal producing system is prior to detecting the binding. 

13. A universal protein array, comprising 

a plurality of substantially pure target polypeptide samples provided on a solid 

support, 

wherein the samples are immobilized on the solid support in an addressable paden,. 

1 4. The array of claim 13, wherein each address contains only one substantially pure 

target polypeptide. 



19. 

20. 
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'5. TDe array of claim 13. wherein .he addresses are arranged m rows and columns 

. -n.e array of claim '3. wherein the array is arranged in a computer readable format 
>7. The array of claim '3, comprising at least 10 different polypeptide samples 
18. The airay of claim 13, comprising at least 30 different polypeptide samples 
The array of claim 13. comprising at least 100 diflferem polypeptide samples 
The array of claim 13. wherein the an^y comprises a microanay 

21. -r^'-rrayofclaim 13. wherein the solid support comprises glass, nitrocellulose, 
polyvmyhdene fluoride, nylon, fiber, or combinations thereof 

22 The array of claim 13. wherein the polypeptides comprises transcriptional fa«ors 
transcriptional activators, or transcriptional coactivators. 

f T...n ""'^^P''*^" '"-"P"^^ TFllA. TFHB. TBP 

^D^nE.TP„P.f:T.„H.Pol,,aXR.TR,Oct,. 

G4-CTF.G4-Spl.G4.ElA.G4.,E,G4.Tat,PC4-P.PC4-N.PC4-C.PC4-AS,PC4-ml PC4-m2 ' 
C4^3.PC4-m4,PC4.m5.PC4.m^^ 

TAF250. Topo I (wt), Topo I (mt). Topo 1 (wt)*. Topo 1 (nati). ASF. SR. GST-Nu. or GST-K ' 

24. Akitfordeterminii,gpolypep,ide.bindingofaprobemolecule.comprising 
a polypeptide array; and 

instructions. 

25. The kit of claim 24. wherein the instructions include directions for exposing the 
probe molecule to an anay of substantially purepolypeptidesonasuppoa^ 

he probe .olecule is capable ofbindingtooneormore of thepolypeptides of the support to detect 
b.olog,cal interactions between the probe molecule and the one or more polypeptides 

26. The kit of claim 24, wherein the polypeptide array comprises a microarray. 

27. The kit ofdaim 24, further comprising a buffer. 

28. The kit of claim 24, wherein the polypeptide array comprises a plurality of 
substantially pure polypeptide samples. 

Be kit of claim 24, funher comprising a probe molecule standard. 
Th. kit of claim 29, wherein the probe molecule standard comprises a label 
TT.e kit of claim 28, wherein the substantially pure polypeptides comprise 
transcriptional factors, transcriptional activators, or transcriptional coactivators. 

32. The kit of claim 31, wherein the polypeptides comprise TFIIA, TFIIB TBP 
™D.TTllE.TF„,f:TFllH.Po,„,RXR,m 

G4-CTF. G4.SP1. G4-E1 A. G4-IE. G4-Tat, PC4-P. PC4-N, PC4-C. PC4-AS. PC4.ml PC4-m2 
PC4.3.PC4-m4.PC4-m,PC4-m6,PC4. 

TAF250. Topo I (wt). Topo I (mt). Topo 1 (wt)-. Topo I (nati), ASF. SR, GST-Nu, or GST-K. 

33. A method of analysis of protein-molecule interactions, comprising: 
obtaining a plurality of different substantially pure protein specimens; 



29. 
30. 
31. 
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array; and 

34. 
35. 



placing a sample of each spec^en in a discrete addressable location on a «cipient 

probing the array with a detectable probe molecule. 
TTie method of claim 33. wherein the anay comprises a microarray 

''^'"^'^^'^'^'"'"^^.whereintheprobemoleculecomprisesanucleicacida 
polypeptide, a Iigand. a fragment thereof, or mixtures thereof. 

36. A method of analyzing a plurality of binding characteristics of an amy of 
polypeptide samples, comprising: . 

samples- ""'"""^ ' ""^ """"''"^ " "'""""'^ P°'y'^P''<*« 

the universal prote.n aaay to identify those samples to which the first probe binds; 
(c) detecting a first binding pattern of the first probe; 

seen . K K f """""^ ' P'^"' ''•^"''^ to which the 

second probe binds. 

37. The method of claim 36. further comprising stripping bound first probe from the 
array pnor to exposing the array to the second probe. 

38. -n,e method of claim 36. wherein the protein array comprises * 
a plurality of substantially pure target polypeptide samples; and 

a solid support, 

wherein the samples are immobilized on the solid support in an addressable pattem 

39. Ti,e method of claim 36. wherein the first probe and the second probe are selected 
from different classes of molecules. 

40. n,e method of claim 36. wherein the protein array comprises a microarray 

41. The assay of claims I orlO, wherein detection is automated. 

42. The method of claim 36. wherein detection is automated. 
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FIG. 1A 



FIG. 1B 
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FIG. 1C 
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FIG. 2A 
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FIG. 3A 
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